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This study synthesized the literature on the rates of referral for intervention or assessment of students 
from three racial groups: Caucasian, African American, and Hispanic. Ten studies, published or un- 
published, that presented frequency counts for the population and the referred sample for at least one 
school district yielded 44 comparisons. For comparisons between African American and Caucasian 
students, the mean risk ratios comparing the referral rates were significantly different from zero. No 
significant differences were found between the referral rates of Hispanic students and Caucasian stu- 
dents. Implications, limitations, and directions for future research are presented. 


Determining whether students of different racial or ethnic 
groups are disproportionately identified for special educa- 
tion services is an issue that has been debated for more than 
30 years (see Heller, Holtzman, & Messick, 1982; Reschly, 
1997). Research on this issue has provided valuable data 
documenting the existence and details of the problem, but in 
general this research has not identified specific strategies to 
adequately address the issue. In order to develop strategies and 
materials to reverse and prevent disproportionate representa- 
tion, however, all stages in the process of special education 
eligibility must be examined in order to determine when and 
where disproportionate representation occurs. The methods of 
determining disproportionate representation in special educa- 
tion programs must be examined and extended to the stage of 
referral, which is often considered the most important step in 
the special education eligibility process (Ysseldyke & Algoz- 
zine, 1983). 

Disproportionate Representation 
in Special Education 

In 1968, Lloyd Dunn published an article noting the over- 
representation of minority students and students from low- 
SES backgrounds in special day classes for mild mental 
retardation (MMR). This was one of the earliest and most 
well-known articles citing disproportionate representation in 
special education. It is often cited as a prophetic publication, 
one that occurred at the right time with the right message to 
serve as a catalyst for opening a proverbial floodgate, trigger- 
ing much debate and additional research. In the article, Dunn 
(1968) stated that 60% to 80% of students in special day 
classes for MMR were from “low-status backgrounds” such as 


minority ethnic groups, nonstandard-English-speaking group, 
and non-middle class environments. 

A few years later, Jane Mercer (1973) documented dis- 
proportionate representation of minority students in special ed- 
ucation classes in Riverside, California, using data collected 
in the early to mid-1960s. She determined that the percent- 
ages of different groups placed in mental retardation (MR) 
classes were discrepant. The percentage of Mexican Ameri- 
can students placed in MR classes was four times their per- 
centage in the population. Eor African American students, that 
figure was three times their percentage in the population. 

Jeremy Finn (1982) was the first researcher to examine 
national disproportionate representation trends by using the 
data from the Elementaij and Secondary School Civil Rights 
Compliance Report from the Office for Civil Rights (OCR; 
U.S. Department of Education, Office of Civil Rights, 1994). He 
found that African American students were overrepresented 
in classes for MR and emotional disturbance (ED), Native 
American students were overrepresented in learning disability 
(LD) classes, and Asian American students were underrepre- 
sented in almost every category. Contrary to Mercer’s (1973) 
findings, Hispanic and Caucasian students were found to be 
identified at similar rates. This discrepancy in results may have 
been prompted by the, Diana v. State Board of Education (1970) 
and Guadalupe Organization v. Tempe Elementary School Dis- 
trict No. 3 (1972) court decisions regarding the use of non- 
verbal intelligence tests in the determination of eligibility. 

Since Finn, use of the OCR Compliance Report to exam- 
ine disproportionate representation has been common. Chinn 
and Hughes (1987) compared the results from the 1978, 1980, 
1982, and 1984 OCR surveys. They found that the trends doc- 
umented by Finn continued through the 1980 to 1984 surveys: 
African American students were overrepresented in the MR 
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and ED categories. Native American students were over- 
represented in the LD category, and there was widespread 
underrepresentation of Asian American students. An addi- 
tional comparison that Chinn and Hughes made was that of 
representation in classes for students with gifts and talents 
(GT). In this category, Asian American students were over- 
represented, whereas African American, Hispanic, and Native 
American students were underrepresented. 

The national patterns noted hy Finn (1982) and Chinn and 
Hughes (1987) have been robust and steady over time. More 
recent OCR surveys in 1986 (Reschly & Wilson, 1990), 1990 
(MacMillan & Reschly, 1998; Reschly, 1997), and 1992 (Os- 
wald, Coutinho, Best, & Singh, 1999) have been used to 
verify these patterns. In each updated OCR survey, African 
American students were overrepresented in the MR and ED 
classifications, Native American students were overrepre- 
sented in the LD category, and Asian American students were 
underrepresented in almost every category, with speech- 
language impairment (SLI) generally being the exception 
(Chinn & Hughes, 1987). 

Why Disproportionate Representation 
Is Problematic 

Disproportionate representation is problematic if the program 
in question is ineffective or stigmatizing or if the process of 
identification and placement is not applied equally to differ- 
ent groups of students (Heller et al., 1982). MacMillan and 
Reschly (1998) pointed out that minority representation in pro- 
grams such as Head Start, Follow Through, and Title I is sim- 
ilarly disproportionate to representation in special education, 
yet it has not been criticized as much. In addition, minority 
representation in GT programs has been criticized (Chinn & 
Hughes, 1987; Harry & Anderson, 1994; Patton, 1997), but 
for M«£/e/Tepresentation, not overrepresentation. This is most 
likely due to the view that GT programs are enriching and gen- 
erally positive experiences. So why is disproportionate repre- 
sentation in special education criticized so severely? There are 
three main reasons; labeling ejfects, segregation of placement , 
and presumed ineffectiveness of special education. 

The influence of negative labels is believed to have dele- 
terious effects on the treatment of, placement of, and outcomes 
for students (Patton, 1998). Mehan, Hertweck, and Miehls 
(1986) found that teachers focus on the negative behaviors of 
students who are considered to have behavior problems, even 
if the behaviors are not significantly different from those of 
other students in the same classroom. In the Mehan et al. study, 
students were divided into referred and nonreferred groups. 
Every student in each class exhibited behaviors that the teach- 
ers, when asked to watch a video recording of their classrooms, 
had identified as negative. Interestingly, the teachers identi- 
fied 46.5% of the negative behaviors of the referred students 
but only 13.9% of the negative behaviors of the nonreferred 
group. The authors concluded that the teachers were attempt- 


ing to confirm their decisions regarding referral (i.e., their per- 
ceptions of the student). 

The second problematic area is placement. The least re- 
strictive environment (ERE) mandate of the Individuals with 
Disabilities Education Act (IDEA) states that students with 
disabilities must be educated with nondisabled children “to 
the maximum extent appropriate” (“Assistance to States for 
the Education of Children with Disabilities,” 1999, p. 12457) 
and that they may be removed from the general education en- 
vironment only if they cannot be satisfactorily educated with 
the use of supplementary aids and services. Placements that 
allow maximum access to the general education curriculum 
and contact with nondisabled peers (i.e., the general educa- 
tion classroom) are preferred unless the lEP team decides that 
an alternative placement is required. Serwatka, Deering, and 
Grant (1995) noted that African American students are more 
frequently placed in more segregated settings than are Cauca- 
sian students. This is prominent in higher incidence categories 
such as MMR (Reschly, 1988). Disproportionate placement 
of African American students in more segregated settings also 
occurs in lower incidence categories such as physical impair- 
ment and visual impairment (Serwatka, Dove, Hodge, 1986), 
even though representation rates are not disproportionate in 
these categories. 

Many reasons are given for this disproportionate place- 
ment: lack of a consistent identification process, bias in assess- 
ment instruments, overrepresentation of African American 
students in categories (i.e., MR and ED) that have been tradi- 
tionally served in more segregated settings, and problems re- 
lated to other variables such as poverty (Serwatka et al., 1995). 
The most commonly cited factor, however, is cultural differ- 
ences. Many researchers have argued that there is a specific 
African American behavioral style (Hale-Benson, 1986; Hil- 
liard, 1992). Today’s U.S. teachers are primarily Caucasian 
(90%) and female (66%; U. S. Department of Education, Of- 
fice of Educational Research and Improvement, 1998), which 
can generate conflict between the teacher’s cultural behav- 
ioral expectations and non-Caucasian male students’ behaviors. 
Because teacher tolerance is the prime indicator for identifi- 
cation of behavior problems (Gerber & Semmel, 1984) and 
because teachers are generally less understanding (i.e., toler- 
ant) of behaviors that are not part of their cultural experience 
and expectations (Lambert, Puig, Rowan, Lyubansky, & Win- 
frey, 1998; McIntyre, 1992), students who come from a culture 
other than the teacher’s are at greater risk for being identified 
as having a behavior problem and thus being placed outside 
the general education classroom. 

A third reason disproportionate representation in special 
education is thought to be problematic is because special ed- 
ucation is often presumed to be ineffective (Dunn, 1968; Harry 
& Anderson, 1994). Research into the effectiveness of special 
education has been inconclusive and occasionally contradic- 
tory (Carlberg & Kavale, 1980; Wang & Baker, 1985-1986; 
both as cited in Artiles & Trent, 1994). There is also evidence 
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that different programs and placements may be differentially 
effective (Kavale & Forness, 1999). 

Methods for Determining 
Disproportionate Representation 

Although there is little disagreement that disproportionate rep- 
resentation can be problematic, several methods have been 
used to demonstrate differential representation of various groups 
in special education. Determining the extent of dispropor- 
tionate representation is important to establishing how large 
a problem the issue is (or isn’t). The first approach used, in 
Dunn’s (1968) classic article, was his “best judgment.” Dunn’s 
figure of 60% to 80% of special day-class students being from 
minority or low-SES backgrounds was deduced from his years 
of experience and observation. One person’s judgment (even if 
supported by other people’s judgment) is not a reliable method 
for estimating the scope of a problem. This is especially true 
when the stakes, such as funding and civil rights issues, are 
as high as in the area of disproportionate representation. 

Another approach that has been used is comparing the 
percentage of students from different racial groups within a 
certain category or placement. This has been termed percent 
of category by group, or the composition index (MacMillan & 
Reschly, 1998; Reschly, 1997). Chinn and Hughes (1987) and 
Mercer (1973) used this method. For example, Chinn and 
Hughes reported that in 1978, 45.3% of the MMR population 
was African American, whereas only 20.1% of the total pop- 
ulation was. Although this method appears to present the fig- 
ures in a straightforward manner, it has been criticized as 
actually inflating the appearance of disproportion. Reschly 
(1997) cited the example that 85% of elementary teachers are 
women, yet the overwhelming majority of women are not ele- 
mentary teachers. In other words, even though the composi- 
tion index is high, the percentage of women in the group is low 
and may provide a more accurate picture of the issue. 

Percentage of group in a category or placement, also often 
termed the risk index, is another method for presenting over- 
representation results (Reschly, 1997). It is calculated by di- 
viding the number of students of a certain group (e.g., African 
American) in a certain category or placement (e.g., LD) by the 
total number of students in that group. An example of this ap- 
proach can be found in MacMillan and Reschly (1998). In 
1990, according to the OCR Compliance Report, nearly 35% 
of students classified as MMR were African American, but 
only 2% of African American students were classified as MMR. 
Obviously, different methods of calculation provide different 
pictures of the severity of the same issue (see MacMillan & 
Reschly for a more thorough comparison of the composition 
index and the risk index). 

Rather than rely solely on the risk index, some researchers 
have explored other approaches for comparing groups. The first 
alternate approach was used by Finn (1982), who calculated 
the odds of a student of a certain group (e.g., African Ameri- 


can) being identified in a certain category (e.g., MR). In other 
words, Finn took the number of students of that group in that 
category and divided that number by the number of students 
in that group not in that category. The resulting number is then 
divided by the odds of students of all other groups being iden- 
tified for the same category. This measure is one form of a 
statistic known as the odds ratio (Finn, 1982; Lipsey Wil- 
son, 2000). Although this form of odds ratio provides a con- 
sistent measure of the likelihood of a certain event, it fails to 
include a measure of the risk index. It also provides only a bi- 
nary comparison of one group (i.e., members vs. nonmembers), 
rather than a direct comparison of groups. 

In order to remedy these shortcomings of the odds ratio, 
other researchers (Fleiss, 1994; Kleinbaum, Kupper, Muller, 
& Nizam, 1998) have proposed a method known as the rate 
ratio, which is also referred to as relative risk. Relative risk 
compares the risk index for one group to the risk index for an- 
other group (or the total population; see Coutinho & Oswald, 
1998, for an example). In addition to addressing the problems 
of the risk index and direct comparison of groups, current ad- 
vances in meta-analytic statistical approaches allow for sta- 
tistical significance testing of the findings. Not only does this 
type of measure address the issues that have been of concern 
for the past 30 years, but it also allows for quantitative analy- 
sis and interpretation in addition to the previously used, more 
qualitative judgments. For example, the relative risk of a 
school district that has African American and Caucasian spe- 
cial education identification rates of 14% and 12%, respec- 
tively, is 1.17. A rate ratio of 1.00 would indicate that the 
individual risk for one group is identical to the risk index for 
another. 

An issue that is still pertinent to both the odds ratio and 
the rate ratio is determining the referent group to be used (i.e., 
the denominator used in the equation). There are three main 
methods of calculating this denominator; 

1 . use the odds or rate for all students not in the 
target groups, 

2. use the odds or rate for all students in the pop- 
ulation of interest, or 

3. use a consistent group for comparison. 

The first method was used by Finn (1982). As already stated, 
this method does not include a direct comparison of groups 
because the composition of the denominator changes for each 
target group (e.g., African American, Hispanic). The influence 
of large groups (e.g., Caucasians) on the denominator will be 
greater than that of smaller groups (e.g.. Native American). 

The second method addresses one shortcoming of the 
first — it provides a common denominator. Using the odds or 
rate for the total population of interest as the denominator en- 
sures that it will be consistent across comparisons. An addi- 
tional problem is that the numerator of the equation (i.e., the 
target group) is included in the denominator. As the size of 
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the target group increases, the dependency of these data in- 
creases. This means that the magnitude of the ratio may, in 
part, depend on the size of the target group. 

The third method of calculating the denominator is to use 
a consistent group as the comparison, for example, using Cau- 
casian students as the comparison group for all target groups 
(e.g., African American and Hispanic students). This provides 
direct comparison of groups, offers a common denominator, 
and eliminates dependency in the formula. One assumption of 
this method, however, is that the comparison group is accu- 
rate or “true.” When using Caucasian students as the compar- 
ison group (as is generally the case, hy reasoning that they are 
the largest race/ethnicity group in the United States), the im- 
plicit assumption is that the odds or rate of identification for 
Caucasian students is appropriate or accurate. When using this 
method of denominator, it may be important to not assume di- 
rection of causation or effect. A discrepancy between the iden- 
tification rates for African American and Caucasian students 
may be an indication of ove/ representation of one group or 
(i«c/e; representation of the other. It should be considered only 
as the first step In the investigation. Further steps examining 
the effect of intervening variables must also be conducted. 

Application of Relative Risk 
to National Databases 

The disproportionate representation of racial groups in differ- 
ent categories of special education eligibility has often been 
established using large databases such as the OCR Compli- 
ance Report. This database was first established in 1968 as a 
means of monitoring demographic patterns in school districts 
that previously had civil rights violations or who applied for 
funds under the Emergency School Aid Act, as well as pro- 
viding a sample representative of other districts across the coun- 
try on 13 variables important to OCR (Finn, 1982). 

Another large national database is that maintained by the 
Office of Special Education Programs (OSEP) for the 1998- 
1999 school year (U.S. Department of Education, 2000). In- 
formation from this database also indicates overall patterns of 
disproportionate representation. One can use the “All Dis- 
abilities” eligibility frequencies for students ages 6 to 21 years 
from Table AA3 (p. A-5) and U.S. student population race/ 
ethnicity frequencies from the Digest of Education Statistics 
(available online at http://nces.ed.gov/pubs2001/digest/dt039. 
html) in the equation for a rate ratio. The result indicates that 
African American students were found eligible for special ed- 
ucation at a rate 1.16 times that for Caucasian students (13.97% 
vs. 11.99%). Although this number may not appear extraor- 
dinary in isolation, when applied to an aggregation of 1 mil- 
lion students (approximately 1 out of every 50 U.S. students), 
160,000 more African American students than Caucasian stu- 
dents are expected to be placed in special education programs. 
For Hispanic students, the rate was .92 when compared to 
Caucasian students. This indicates that Hispanic students are 
identified as eligible for special education services at a lower 


rate than are Caucasian students (i.e., 80,000 fewer Hispanic 
students than Caucasian students per million). These figures, 
however, are given for the overall rate of eligibility. Because 
representation is differentially disproportionate in different 
categories, the rate ratios for individual categories of eligibil- 
ity can be even more alarming. 

The OCR Compliance Report also provides data on eli- 
gibility rates for different racial groups in the high-incidence 
categories of special education eligibility. For the category of 
MR, which has been the primary focus of much of the liti- 
gation and research on disproportionate representation (for a 
review, see Reschly, 1988), African American students were 
found to be eligible at 2.23 times the rate of Caucasian stu- 
dents, whereas Hispanic students were identified at .65 times 
the rate of Caucasian students. In the category of emotional 
disturbance (ED), African American students were classified 
at 1.68 times the rate of Caucasian students, while Hispanic 
students were classified at .65 times the rate of Caucasian stu- 
dents. In contrast to these large differences, the rates for the 
category of learning disability (LD) are more similar. African 
American students were classified at 1.11 times the rate for 
Caucasian students, and Hispanic students were classified at 
1.08 times the rate. 

Although the reliability and validity of the large-scale 
databases has been questioned (see Finn, 1982; Reschly, 1997), 
the conclusions drawn from the information in them are gen- 
erally agreed to be accurate. What is missing from this dis- 
cussion, however, is an analysis of the variables that produce 
or are related to the overrepresentation patterns. For exam- 
ple, a number of variables — such as overall achievement rates, 
teachers’ misperception of behavior, and test biases — may be 
involved. For a variety of reasons, referral rates are one of the 
most important variables related to understanding overrepre- 
sentation. 

Importance of Referral 

Referral for assessment or intervention has been cited as one 
of the most important predictors of future special education 
eligibility (Artiles & Trent, 1994; Mehan et ah, 1986; Yssel- 
dyke & Algozzine, 1983) because most students referred for 
consideration of special education are eventually placed in 
special education programs. This phenomenon is open to dif- 
fering interpretations. Some researchers have suggested that 
teachers are accurate judges regarding which students need 
help. For example, Gresham, Reschly, and Carey (1987) found 
that teachers’ judgments of student achievement (as measured 
by the Teacher Rating of Academic Performance) were as ac- 
curate as measures of intelligence (Wechsler Intelligence Scale 
for Children-Revised; Wechsler, 1974) and achievement {Pea- 
body Individual Achievement Test; Dunn & Markwardt, 1970). 
One potential limitation of this study, however, was the use of 
a reevaluation sample (i.e., students in the LD group had al- 
ready been in special education for nearly 3 years). To address 
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this limitation, Gresham, MacMillan, and Bocian (1997) con- 
ducted a similar study using students being initially referred 
for evaluation. Their results corroborated the accuracy of 
teacher judgments. More recently, Hecht and Greenfield (2001) 
found that teachers’ ratings of student reading performance 
predicted future performance with an accuracy similar to that 
of reading-related tests. 

A competing suggestion for interpretation of the strong 
relationship between referral and eligibility is the presence of 
confirmato?y bias (O’Reilly, Northcraft, & Sabers, 1989), a 
term that refers to the tendency of an evaluator or team to draw 
conclusions that align with preliminary hypotheses despite the 
presence of contradictory evidence or a lack of sufficient con- 
firmatory evidence (Darley & Gross, 1983). Using vignettes, 
O’Reilly et al. demonstrated that school psychologists are 
likely to agree with a reason for referral despite contradictory 
evidence. Kastner and Gottlieb (1991) demonstrated that the 
reason for referral is a strong predictor of classification cate- 
gory. This strong correlation (in the face of conflicting data) 
may be a result of (a) expectancy effects (i.e., a tendency to find 
that for which one is looking or for expectations to be “re- 
ified”; Rosenthal, 1991) or (b) illusory correlation (i.e., an 
assumption of the existence of an explanatory variable [e.g., 
neurological deficits causing poor academic performance] 
based on chance co-occurrence or an unmeasured variable; 
Gnys, Willis, & Faust, 1995). 

In addition, if different groups are referred at different 
rates, any differences between the groups within the referred 
sample may be due to a larger proportion of one group being 
included in the sample. For example, if 23 % of African Amer- 
ican students in a district were referred but only 9% of Cau- 
casian students were, differences between these two referral 
groups may be attributable to differential or biased selection 
criteria, differences in the underlying distribution of the trait 
for the two groups, differences regarding related variables, or 
other causes. Difference in referral rates has implications for 
(a) whether the same criteria for referral are being used for all 
students and (b) whether other variables are involved in dif- 
ferences in groups’ performance. 

The relationship between eligibility and referral suggests 
that a better understanding of disproportionate representation 
in special education categorization requires investigation of 
factors affecting referral rates and processes. There are two 
primary reasons that less attention has been directed toward re- 
ferral than toward eligibility. First, most referral bias research 
is analogue, using reactions to vignettes in order to control for 
the variables of race (Zucker & Prieto, 1977), gender (Greg- 
ory, 1977), or SES (Lanier, 1975) while keeping other vari- 
ables, such as achievement, consistent. Although vignettes and 
simulated cases have been instrumental in adding to the knowl- 
edge base, the generalization of the results of studies employ- 
ing them has been questioned (Bahr, Fuchs, Stecker, & Fuchs, 
1991; Shinn, Tindal, & Spira, 1987). 

The second reason that there is less research on referral 
than on eligibility is that most studies comparing eligibility 


rates of various groups use large national databases. Referral 
data (disaggregated by racial group or gender) are not col- 
lected for such large databases as OCR’s Compliance Report 
or the U.S. Department of Education’s Annual Report to Con- 
gress on the Implementation of IDEA. In order to compare eli- 
gibility rates to referral rates, they should be in the same form 
(i.e., a rate ratio) and drawn from similar populations. Lew 
studies have actually been conducted that examined referral 
rates by comparing them to population rates. In addition, all 
of these studies have been conducted within individual school 
districts. The only source of data in a consistent form on sim- 
ilar populations for a large-scale analysis thus is a synthesis 
of the results from individual studies. 

Purpose of the Study 

The present study synthesized the results of individual stud- 
ies in order to compare referral rates to population rates of 
students from different racial backgrounds. Quantitative syn- 
thesis of this research may allow for a better understanding of 
overall referral rates and the processes involved. This will pro- 
vide a basis for future research aimed at identifying or ruling 
out various factors of risk of bias. Our hypothesis was that the 
referral rates of the different groups are similar to the eligi- 
bility rates that have been reported. In other words, African 
American students are referred at a higher rate than are Cau- 
casian students, and Caucasian and Hispanic students are re- 
ferred at similar overall rates. 

Method 

Criteria for Eligibility of Studies 

Studies were selected on the basis of criteria intended to pro- 
vide a comprehensive view of the samples used in research on 
overrepresentation. The eligibility criteria were as follows. 

Distinguishing features meant that a study had to report 
the frequencies or proportions of racial groups within a sam- 
ple of students referred for assessment or Intervention as well 
as the population from which the sample was drawn. The sam- 
ple could be (a) the entire set of referrals in a given school 
or district for a given time period or (b) a random sample of 
the entire set. Eligible studies included referrals for psychoed- 
ucational assessment, prereferral assistance, or special edu- 
cation eligibility. Office referrals, disciplinary referrals, and 
referrals solely for related services such as counseling, health, 
or speech services were not eligible. Eligible studies had to 
use school populations for the sample, not populations in hos- 
pitals, university clinics, and correctional institutions. 

Research respondents meant a study had to involve stu- 
dents In elementary, middle, and/or secondary settings (i.e., 
students ages 5-2 1 years without a high school diploma or equiv- 
alent). Public or private school settings were eligible, provided 
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they served students in this range. Preschool, early childhood, 
or postsecondary schools were not eligible. A study had to 
also include multiple racial groups (e.g., African Americans 
and Caucasians). 

Research methods meant a study had to include quanti- 
tative demographic information regarding the racial compo- 
sition of a specific referred sample and the population from 
which it was drawn. Vignettes, case studies, opinion papers, and 
literature reviews were not eligible. 

Cultural and linguistic range meant a study had to be con- 
ducted in the United States. Choosing only one country was 
important because systems of and criteria for referral for eval- 
uation and intervention vary from country to country. Because 
we sought to study the process in the United States, studies 
conducted in other countries were excluded. 

Time frame meant the data for the studies had to have 
been collected after 1975, which was the year that the Educa- 
tion for All Handicapped Children Act of 1975 (Pub. L. 94-142) 
was passed. This law set the stage for special and remedial 
education as it is practiced today; hence, any data gathered 
after this date were within the context of the Act. 

Publication type meant published and unpublished stud- 
ies were eligible, regardless of source. This included, but was 
not limited to, books, book chapters, refereed journal articles, 
nonrefereed journal articles, dissertations, technical reports, 
conference papers, and unpublished manuscripts. 

Identification and Retrieval 
of Eligible Studies 

The entire population of studies, both published and unpub- 
lished, that met the eligibility criteria was targeted for retrieval. 
The data for the current meta-analysis were identified using a 
variety of sources and three general methods. The first method 
involved keyword bibliographic searches of computerized 
databases, including Educational Resources Information Cen- 
ter (ERIC), Psycinfo, Social Sciences Abstracts, Education 
Abstracts, and Dissertation Abstracts International . The sec- 
ond method was manual searches of popular refereed journals 
in the fields of special education and school psychology. The 
journals we searched ss/ere Exceptional Children (1975-2000), 
Journal of Negro Education (1975-2000), Journal of School 
Psychology (1980-2000), Psychology in the Schools (1975- 
2000), School Psychology Quarterly (1980-2000), and School 
Psychology Review (1975-2000). The third method was to ex- 
amine the references of retrieved studies for additional po- 
tentially eligible studies. 

The initial search identified more than 1,500 citations 
that potentially could contain the desired information. After 
reviewing the abstracts, we determined that 121 of these doc- 
uments met all of the inclusion criteria, and we targeted these 
for retrieval. All 121 were retrieved through a library search 
or via interlibrary loan. 

Only 9 of the 121 retrieved studies contained the data re- 
quired for inclusion. One additional report, however, presented 
New York City population proportions but not the frequencies 


(Gottlieb & Alter, 1994). The total enrollment for the 32 New 
York City Community School Districts (CSD) was presented; 
however, the enrollment for each CSD was not. Using the 
School Based Budget Reports (Crew, 1999), we were able to 
determine the proportion of enrollment of the 32 CSDs within 
the Board of Education. These proportions were then applied 
to the total reported enrollment to derive the frequencies of 
students of each racial group. This creates a degree of uncer- 
tainty because the budget report data were from the 1997- 
1998 school year, whereas the Gottlieb and Alter data were 
from 1992-1993. These proportions may vary significantly 
over time and thus must be considered an approximation. This 
allowed for 10 usable studies yielding 44 comparisons. 

Coding System 

The first author coded the data. Each study was coded twice, 
with a minimum of 2 weeks between codings to ensure con- 
sistency of the coding procedure. Intracoder agreement was 
100%. This high degree of consistency probably was aided by 
the low amount of necessary inference in the coding process. 
In addition, the second author coded 12 of the 44 comparisons 
(27%), and intercoder agreement was 99.3%. 

Data were coded in two general categories of variables: 
study level and effect-size level. At the study level, four 
variables were coded: publication year, publication type (e.g., 
journal article, dissertation), state in which the study was con- 
ducted, and length of the study. At the effect-size level, 30 vari- 
ables were coded. Twelve of these variables were not included 
in any study, and 3 were consistent across all studies. These 
15 variables therefore were eliminated from subsequent analy- 
ses. The 15 remaining variables included school level (e.g., 
elementary, middle), total size of the referred sample, and which 
racial group was predominant in the population. Also coded 
were the frequencies and/or proportions of African American 
(AA), Caucasian (C), and Hispanic (H) students in the re- 
ferred sample and in the overall population. Eligible studies 
and their characteristics are listed in Table 1 . 

Data Analysis 

The desired analysis was a comparison of the rates at which 
each group was referred (i.e., the percentage of the popula- 
tion of the group that is referred). Once the data from the in- 
cluded studies were coded, effect sizes were calculated. Due 
to the nature of the data (i.e., frequency counts across two 
factors) and to compare the results to studies of eligibility 
rates, the rate ratio was used as the effect-size statistic (here- 
after, the terms effect size and rate ratio are used inter- 
changeably). The rate ratio is used for comparing two groups 
across levels of a dichotomous variable (Eleiss, 1994). For this 
meta-analysis, the referred sample and the population served 
as the levels of the dichotomous variable (the groups being 
represented by racial groups). The equation used to calculate 
the rate ratio for each two-group comparison thus was as 
follows: 
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frequency of Group A in referred sample 

frequency of Group A in population 

ES = Rate Ratio = 

frequency of Group B in referred sample 

frequency of Group B in population 

This equation yielded a statistic comparing the referral rates 
of two groups. 

Because the rate ratio does not allow comparison across 
three levels of a variable, separate rate ratios were calculated 
comparing African American and Caucasian students (AA to 
C) and Hispanic to Caucasian students (H to C). Caucasian stu- 
dents were used as the comparison group because they repre- 
sent the largest group nationwide. This means that most studies 
were expected to yield two effect sizes, one for each compar- 
ison, to be used in separate analyses. Unfortunately, only three 
studies included frequency counts for Hispanic students (Ar- 
gulewicz & Sanchez, 1983; Farrell, Olson, Malloy, & Boykin, 
1983; Gottlieb & Alter, 1994). Therefore, a total of 44 effects 
sizes were included in the AA to C comparison, whereas 35 
were included in the H to C comparison. 

Before the comparisons could be made, however, some 
adjustments were required. First, because the rate ratio is sta- 
tistically inconvenient (i.e., centered on 1, with a minimum 
value of 0 and no maximum value; Lipsey & Wilson, 2000), 
the natural log of each effect size was used in all analyses. 
The logged rate ratio has an approximately normal distribu- 
tion (M = 0, SD = 1.83), which allows it to be presented in a 
form similar to other measures of effect size and facilitates 
the calculation of the standard error (Lipsey & Wilson, 2000). 

Second, greater weight was given to effect sizes based 
upon the inverse of the sampling error variance to account for 
the various sample sizes that the effect sizes were based upon. 
For the rate ratio, the equation for the sampling error was as 
follows: 


I-Pa + 1-Pb 

# Group A in population (P^) # Group B in population (F^) 

where P = the risk index for a group (i.e., frequency of group 
in category /frequency of group in population. The weight as- 
cribed to each effect size is then 1 divided by this number (the 
sampling error) squared (Lipsey & Wilson, 2000). 

The first step in the analysis was to plot both sets of ef- 
fect sizes using histograms and box plots to determine if there 
were any outliers. Once the distributions were determined to 
be approximately normal, the mean and the confidence inter- 
val (Cl) of each effect-size distribution were calculated. A test 
of the homogeneity of each distribution was also conducted. 
If the homogeneity results are significant (i.e., Q [the homo- 
geneity statistic] exceeds the upper tail critical value of the 
chi-square distribution at k -1 degrees of freedom), there is sig- 
nificant variability in the distribution of effect sizes. Descriptive 
study characteristics (e.g., predominant group in the district, 
state in which the district is located) can then be used to ex- 
plain portions of this variance. 


Results 

Table 2 presents the characteristics of the comparisons from 
the studies in the meta-analysis. As can be inferred, the report 
(Gottlieb & Alter, 1994) that yielded 32 of the 44 comparisons 
(73%) provided consistent data across all of its comparisons. 
The variability among the comparisons from the Gottlieb and 
Alter report was tested and found to be significant for both 
the AA to C (2 = 216.74, p < .00005) and the H to C (Q = 
201 31, p < .00005) comparisons. This is potentially problem- 
atic because to partial out any variability among the compar- 
isons of that study, a variable is needed to discriminate among 
them. If a variable is constant across the 32 subsamples, it 
cannot be used to partial out that variability. The predominant 
racial group variable (i.e., which racial group constituted the 
largest percentage of the population) was the only variable 
coded that varied among the comparisons in the Gottlieb and 
Alter study; therefore, it was used as the explanatory variable 
in the analyses. 

Also noteworthy is that although the logged rate ratio was 
used in all analyses, the scores presented in this and all sub- 
sequent sections have been transformed (i.e., unlogged) and are 
presented in their original form (i.e., the rate ratio; see above). 
This was done to facilitate discussion of the findings. The rate 
ratio is interpreted as the referral rate of the first group as 
compared to the second group. For example, in the African 
American to Caucasian comparison, a rate ratio of 1 .5 would 
indicate that African Americans were referred at 1.5 times the 
rate of Caucasians (i.e., given their proportion of the popula- 
tion, African Americans were one-and-one-half times more 
likely to be referred). 

African American 
to Caucasian Comparison 

Visual inspection of the plots was used to determine that the 
effect sizes were approximately normally distributed. The 
44 subsamples with African American and Caucasian fre- 
quencies yielded a fixed-effects model mean effect size of 1 .58 
(confidence interval from 1.53 to 1.63). An analysis of ho- 
mogeneity, however, yielded a Q of 628.71 {p < .0001). An 
analog to the analysis of variance (Lipsey & Wilson, 2000) 
was used to determine if predominant racial group in the pop- 
ulation accounted for the within-group variability. Because 
the results of this analysis revealed that predominant racial 
group did not sufficiently account for the variability within 
groups (within-Q = 519.05,p < .0001), a random-effects model 
was used. This type of model accounts for the unexplained 
variance by inserting a constant (an estimation of the common 
error) into the regression equations (Raudenbush, 1994). Be- 
cause the unexplained variance is accounted for statistically, 
this method is more conservative than a fixed-effects model, 
which does not address the unexplained variance (Berlin, Laird, 
Sacks, & Chalmers, 1989). This allows evaluation of the ef- 
fect that can be attributed primarily to the grouping variable 
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TABLE 2. Characteristics of Included Subsamples 


Variable 

n 

o/ 

/o 

Publication type 

Journal article 

8 

18 

Dissertation 

2 

5 

Technical report 

34 

77 

Publication year 

1978 

1 

2 

1983 

3 

7 

1990 

1 

2 

1994 

32 

73 

1995 

1 

2 

1996 

1 

2 

1997 

2 

5 

1998 

1 

2 

1999 

2 

5 

State conducted in 

New York 

32 

73 

Other than New York 

12 

27 

Length of study (in months) 

11 

41 

93 

24 

1 

2 

Setting 

Elementary school 

6 

14 

Elementary & middle school 

33 

75 

All levels 

5 

11 

Predominant racial group 

Caucasian 

16 

36 

African American 

15 

34 

Hispanic 

13 

30 


Note. Due to rounding, some percentages may not add to 100. 


of interest (in this case, predominant racial group) because the 
unexplained variability (i.e., the error) is controlled for. 

With this accounted-for variability (within-Q = 49. 19, p = 
.18), the between-groups effect was found to be nonsignifi- 
cant (between-Q = 1.42, p = .49). The referral rate of African 
American students, as compared to Caucasian students, was 
not significantly influenced by which racial group was pre- 
dominant in the population under study. Examination of the 
overall effect, however, revealed a significant discrepancy be- 
tween the referral rates of African American and Caucasian 
students {M = 1.32, p = .0005). 

Hispanic to Caucasian Comparison 

Visual inspection of the plots was used to determine that the 
effect sizes were approximately normally distributed. The 35 
subsamples with Hispanic and Caucasian frequencies yielded 
a fixed-effects model mean effect size of 1 .05 (confidence in- 
terval from 1.02 to 1.09). An analysis of homogeneity, however, 
yielded a Q of 410.61 (p < .0001). Attempts at partialing out 


the Q by means of an analog to the ANOVA using predomi- 
nant racial group was unsuccessful (within-Q = 391.46, p < 
.0001). As a result, a random-effects model was adopted that 
accounted for the within-group variability (Q = 38.82, p = .19) 
but yielded a nonsignificant between-group difference {Q = 
1.97, p = .37). Examination of the overall effect size also re- 
vealed a nonsignificant difference {M = 1.06,p = .49). The re- 
ferral rates for Caucasian and Hispanic students were found 
to be similar. 

Discussion 

The results indicated that the referral rates of racial groups 
vary significantly. The initial fixed-effects models indicated 
that the rate of referral was greater for both African Americans 
and Hispanics as compared to Caucasians. Because no inter- 
vening variables were identified that mediated these referral 
rate differences, the variability within each set of comparisons 
was statistically accounted for (i.e., random-effects models 
were used). Although the random-effects African American to 
Caucasian comparison was significant despite the larger confi- 
dence interval, the Hispanic to Caucasian comparison was not. 

Compared to the national rates for eligibility stated pre- 
viously, the referral rates identified by the present analysis in- 
dicate higher rates for both African American and Hispanic 
students. The OSEP-reported eligibility rate for African Amer- 
ican students, compared to that for Caucasian students, was 
1.18 versus a mean referral rate of 1 .32. Eor Hispanic students 
compared to Caucasian students, the OSEP-reported eligibil- 
ity rate of .89 was lower than the mean referral rate of 1.06. Eor 
example, these rates indicated that for every 100 Caucasian stu- 
dents who are referred for assessment or intervention, 132 Af- 
rican American students and 106 Hispanic students are 
referred. In contrast, for every 100 Caucasian students found 
to be eligible for special education, 1 18 African American stu- 
dents and 89 Hispanic students are found to be eligible. This 
indicates that (a) more African American students and a sim- 
ilar number of Hispanic students are referred for special edu- 
cation and (b) more African American students but fewer 
Hispanic students than Caucasian students are found to be eli- 
gible for special education. 

Because the operational definition of referral for this 
analysis included referrals for assessment or intervention, stu- 
dents who ultimately were provided with services through 
means other than special education (e.g.. Title I, Bilingual or 
English Language Learner [ELL] services) appear in the re- 
ferral data but not in the special education classification data. 
Differences in referral and eligibility rates may be explained by 
disproportionate numbers of African American and Hispanic 
students served by Title I and ELL, respectively (MacMillan 
& Reschly, 1998). For example, consider the referral rate of 
100 Caucasian students versus the referral rate of 132 African 
American students. If the base eligibility rate (from the re- 
ferred sample) for Caucasian students were 60%, the corre- 
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spending rate for African American students would be 60% 
times 1.18 (the ratio of African American eligibility rate com- 
pared to Caucasian rate), or 70.8% (94 of the 132 students). 
In addition to the 94 students found to be eligible for special 
education, 10 students might be provided services through 
Title I or a district-wide remedial reading program. Ineligi- 
bility for special education services does not automatically 
equate to inappropriate referral — especially when other out- 
comes (e.g.. Title I) also may indicate appropriate referral. 
Without eligibility rates for Title I and other alternate reme- 
dial services disaggregated by race, an exact account of over- 
all eligibility rates to compare to referral rates is not possible. 
The rest of this discussion thus primarily addresses potential 
sources of variability in rates of referral between different 
groups, acknowledging that referral, assessment, eligibility, 
and placement are interrelated phases of one process. 

The most common reason to refer a student for assess- 
ment or intervention is low achievement (Gottlieb & Alter, 1 994; 
Hyde, 1975; Kastner & Gottlieb, 1991). Because an adverse 
impact on educational performance is an important indicator 
of the need for special education services, this is not surpris- 
ing. Referrals for low achievement most likely culminate in 
classification as MR or LD — depending upon performance on 
achievement and intelligence tests. Another common reason 
for referral, however, is disraptive or disturbing behavior (M. G. 
Anderson, 1994; Hyde, 1975). There is limited evidence that 
African American students may be referred more than Cau- 
casian students for behavior issues (Del’Homme, Kasari, For- 
ness, & Bagley, 1996). Clearly, more research is needed in this 
area; however, several claims have been made that this dis- 
crepancy in referral reason exists and may be rooted in cultural 
differences and/or perceptions of higher rates of behavioral 
difficulties (Harry & Anderson, 1994; Hilliard, 1992; Serwatka 
et ah, 1995). 

Researchers have demonstrated that teachers’ perceptions 
of appropriate behaviors (Rong, 1996) and deviant behaviors 
(Puig et ah, 1999; Weisz et ah, 1988) are culturally defined. In 
addition, it has been demonstrated that teachers may perceive 
the same behavior differently in students of different ethnic- 
ity from themselves (Carlson & Stephens, 1986) or in students 
considered “bad” rather than “good” (Mehan et ah, 1986). 
Given the fact that almost 90% of teachers nationwide are Cau- 
casian (U.S. Department of Education, 1998), the possibility 
exists for cultural misunderstanding or mismatch of expecta- 
tions that might affect referral rates of non-Caucasian students. 

Another implication of the relation between referral and 
eligibility rates is support for the presence of confirmatory 
bias (Darley & Gross, 1983). When a student is referred for 
assessment of special education eligibility, a hypothesis is 
made that the child has a disability and needs special educa- 
tion services (Walker, 1976, as cited in O’Reilly et al., 1989). 
Several mechanisms for confirming this hypothesis have been 
demonstrated in the special education eligibility process. 
These include selective data gathering (Snyder & Gangestad, 


1983), selective data use (Danger & Abelson, 1972), and ig- 
norance of disconfirming data (Nisbett & Ross, 1980). Other 
factors that may contribute to confirmatory bias include the 
differential interpretation of behavior, depending on student 
ethnicity (Carlson & Stephens, 1986) and cultural expecta- 
tions (Collier & Hoover, 1987; Harry & Anderson, 1994), as 
has already been mentioned. These may lead to the creation 
of additional a priori hypotheses for the evaluator or multi- 
disciplinary team members. These hypotheses will then be 
subject to conscious or unconscious strategies of confirma- 
tion. Confirmatory bias, however, is only problematic in in- 
stances of inappropriate referral or negative outcomes. 

When the assessment process confirms an accurate hy- 
pothesis (e.g., a student’s achievement is significantly behind 
his or her ability, and he or she is therefore is determined to 
have a learning disability), no bias can be identified because 
the “biased” decision and the “correct” decision are the same. 
When a student is inappropriately referred, the biased deci- 
sion and the correct decision are at odds and confirmatory bias 
might exist. In the context of disproportionate representation, 
if students of different racial groups are inappropriately re- 
ferred at similar rates, then confirmatory bias is a generic 
problem to address. If students of different racial groups are 
inappropriately referred at different rates, however, there is in- 
equality of process or treatment (Reschly, 1997), and address- 
ing confirmatory bias may be considered in terms of a civil 
rights issue as well as a generic problem. 

The issue of appropriate versus inappropriate referral 
revolves around the outcomes of that referral. As stated previ- 
ously, referral outcomes other than special education eligibil- 
ity or ineligibility are possible. Students may receive services 
through a Section 504 plan, Title I, a district-wide interven- 
tion program (e.g., Reading Recoveij), or supplementary ser- 
vices (e.g., individual or group counseling). Provided the 
student’s needs have been accurately identified and are ad- 
dressed through these services, the outcome of the referral 
should be viewed as positive. As stated previously, dispro- 
portionate representation in special education is often viewed 
as negative because special education is seen as ineffective, 
or worse, detrimental to students’ development. Conversely, 
if special education is viewed as a positive service that ad- 
dresses each student’s individual needs, representation in spe- 
cial education should be proportional to the needs of different 
groups, not their percentage of the population. Because Af- 
rican American and Hispanic students have been demon- 
strated to perform more poorly than their Caucasian peers on 
academic measures (Meece Kurtz-Costes, 2001) and often 
come from more economically disadvantaged backgrounds 
(Kennedy, Jung, & Orland, 1986), some critics have argued 
that their rates of referral and eligibility for special education 
should be higher than the same rates for Caucasian students 
(Forness & Knitzer, 1992; Kauffman, 1999; Oswald, 1995). 
This position is also based on one of the following two 
premises: 
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1 . The assessment measures and selection criteria 
do not bias the outcome in negative ways. 

2. Bias is unimportant because it is better to pro- 
vide special education to a child who does not 
need it rather than not provide services to a 
child who needs them. 

This meta-analysis provides support for the variability 
of referral rates among different groups of students. It also 
highlights the importance of referral in future classification in 
special education categories. These results do need to be con- 
sidered, however, in light of some limitations. 

Limitations 

The number of districts represented in the present study is 
small compared to the total number in the United States. The 
degree to which a sample is representative of the population 
being generalized to is negatively correlated to sample size 
(Glass & Hopkins, 1996). Of the more than 16,000 school dis- 
tricts in the United States today, only 42 were represented in 
this study (two districts were represented twice). With such a 
small sample, the potential error in sampling is great. The 
sampling of these districts is not random either because each 
consented to some degree to participation in a research study. 
In addition, the districts included in the sample were (a) ex- 
clusively from predominantly urban or suburban areas, (b) all of 
medium to large size — only 3 having fewer than 10,000 stu- 
dents, and (c) all public school districts — no private or parochial 
schools were represented. These factors suggest that general- 
ization from this sample to all school districts in the United 
States is questionable. 

It is also possible that the variable(s) that could account 
for the variability in the AA to C comparison were not coded 
for this meta-analysis. Inclusion of additional descriptor vari- 
ables would be necessary for a more comprehensive synthe- 
sis, but this is related to the number of eligible studies that are 
available. If the sample of coded comparisons was larger and/ 
or more diverse (i.e., coming from more studies), it would 
have been possible to analyze more complex models. For ex- 
ample, the effect that publication year or state the district is 
in has on referral rates could have been examined. In addition, 
if the studies had reported the reason for referral more con- 
sistently, this variable might have been found to differentially 
affect the referral rates of racial groups. 

A third limitation is that most of the effect sizes were 
derived from a single research study (Gottlieb & Alter, 1994). 
If other variables may account for some of the variability 
among the effect sizes (e.g., publication year, publication type, 
school level), a larger, more diverse group of studies would 
be necessary to make the appropriate comparisons. Despite 
these limitations, however, there are clear implications for fu- 
ture research and practice. 


Implications for Future Research 

Although more research has focused on disproportionality in 
classification as disabled and placement in special education 
than on referral rates, the patterns identified in this meta- 
analysis are somewhat consistent with others reported in the 
literature. In order to make comparisons between referral and 
eligibility rates on a similar scale, a vast amount of additional 
research is needed to be able to compare groups of similar 
size. This could be achieved in two ways. First, the amount of 
available studies reporting referred sample and population fre- 
quencies could be increased. If researchers adopted inclusion 
of these data as standard practice in articles on dispropor- 
tionate representation in the referral process, a substantial 
body of literature would be developed over time. This liter- 
ature could then be examined not just for overall trends but 
also for intervening variables that might moderate the effects 
of disproportionate representation in referrals. 

Second, a large national database could be developed 
that included frequencies of referral and population for dif- 
ferent racial groups. The best-case scenario would be that the 
OCR Compliance Report or the U.S. Department of Educa- 
tion’s Annual Report to Congress could be restructured to in- 
clude these data. The same sample thus would be used for the 
referral and eligibility data, and a direct comparison would 
not require any generalization between samples. 

Another direction for future research is to disaggregate 
the data at a more specific level than nationally. Finn (1982) 
made comparisons not only at the national level but also by 
disaggregating the data by state. In general, he noted that dis- 
proportion was not an issue in the northeastern, midwest, or 
western states. The states that had the most severe dispropor- 
tionality were in the southeastern region of the United States, 
the region where most of the difficulty in developing inte- 
grated school systems in the 1960s occurred (Spring, 1997). 

Although Finn (1982) compared data at the state level 
and found some interesting trends, he built a good case for 
disaggregating at the district level as well. First, he argued, in 
data aggregated at the state level, large districts may over- 
shadow or obscure smaller districts. This could occur because 
a larger district contains a larger proportion of the state’s pop- 
ulation. It is conceivable that size of the district may affect 
trends in disproportionality. Second, districts with no students 
in a certain program will inflate the state’s population num- 
bers, reducing the appearance of disproportionality. Finn cited 
the example of the 1978 OCR survey, which showed only 12 
states in which all of the districts reported having students 
with MMR. By aggregating at the district level, analyses could 
be done to look at the effect of including or excluding these 
districts. Third, eligibility and placement, although regulated 
at the state level, are district-by-district processes. Just as 
Mercer, Jordan, Allsopp, and Mercer (1996) have shown that 
there is great variability among states in eligibility criteria, 
there is also variability within states on the criteria that dis- 
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tricts utilize. Aggregation of data at the district level would 
allow a closer investigation of this phenomenon and provide 
a better indicator as to which district-level variables may in- 
tervene in this area. 

A third direction for future research is to examine other 
variables that may affect disproportionate representation. Os- 
wald et al. (1999) took preliminary steps in this direction. 
Using the 1992 OCR database, they examined predictors of 
categorization as MR or ED. Their results, aggregated at the 
district level, indicate that environmental factors (e.g., median 
housing income, percentage of children in poverty) predict 
ED rate {R^ = .117) and MR rate {R^ = .356). The addition of 
race variables (percentage of African American enrollment and 
whether a student was African American or not) added sig- 
nificantly to the models (both ps < .0001). Another interest- 
ing finding is that the environmental/racial model was a better 
predictor for African American students than other students. 
Oswald et al. concluded that race-related factors influence the 
identification of African American students more than non- 
African American students. 

As suggested by Oswald et al. (1999), proportion of mi- 
nority students in the district is an important variable to con- 
sider. Although no relationship was found in the present study, 
this could be a result of the small sample (see above). How- 
ever, several other researchers (Finn, 1982; Gottlieb & Alter, 
1994; Harry, 1992) have noted this phenomenon and have doc- 
umented, to varying degrees, that disproportionate representa- 
tion is lower at the extremes. In other words, when the minority 
population in a school is a small percentage of the overall pop- 
ulation (less than 10%) or a very large percentage (over 90%), 
the eligibility rates for different minority groups are more sim- 
ilar than in schools with minority percentages between 10 and 
90. This trend does not hold true for small districts (those with 
total enrollment below 3000), however. In these districts, dis- 
proportion increases as minority percentage increases. 

Another variable to examine is the relation between SES 
and disproportionate representation. Ethnicity has been thought 
of as being a proxy for SES (MacMillan & Reschly, 1998). 
This is illustrated by the impact of poverty on a child’s life re- 
gardless of his or her race (Hodgkinson, 1995). As was shown 
by Oswald et al. (1999), environmental factors provide a 
strong predictor of identification. When Oswald et al. exam- 
ined the specific interaction between poverty variables in a 
district and race, distinct patterns emerged. For MR, African 
American students were overrepresented at all levels of each 
poverty variable (median household income, percentage of 
children in poverty, mean housing value). For ED, however, 
as indicators of poverty in a district decreased (i.e., SES in- 
creased), overrepresentation of African American students in- 
creased. This distinction between MR and ED rates lends 
support to (a) the differentiation between academic and be- 
havioral reasons for referral and (b) cultural tolerance as a 
contributing factor in behavioral referral. SES also needs to 
be considered at different levels of aggregation or influence. 
Anderson, Hollinger, and Conaty (1993) demonstrated that a 


school’s poverty level and individual poverty status both have 
significant and distinct influences on a student’s academic per- 
formance. Further research in this area is certainly warranted. 

Some areas that have not been studied to date also need 
further examination. The Oswald et al. (1999) study focused 
on district-level variables (because that was the level of ag- 
gregation of the data). An interesting comparison would be a 
hierarchical model of schools within districts and the addition 
of school-level variables. Also, because special education is 
intended to be an individual-level decision process, various 
student and teacher variables such as interaction patterns, be- 
havioral variables, and demographic variables may provide in- 
sight into the issue and potential strategies for addressing the 
problem of disproportionate representation. 

Conclusions 

Comparisons of the rate of referral for intervention or assess- 
ment of various racial and ethnic groups are practically absent 
from the research literature. Of the studies that have focused 
on referred samples, most do not report the proportions of 
each group in the population from which the sample was de- 
rived. In addition, most studies that have focused on potential 
bias in referral processes have relied on vignettes rather than 
actual referrals. Both of these methods, while providing much 
information and insight, are limited in their generalizability. 
Studies not reporting population data do not take into consid- 
eration possible differential processes for referral for various 
groups. Studies using vignettes may be capable of detecting 
overt cases of bias in attitudes, but they do not examine the 
actual practice of referral. 

This meta-analysis provides preliminary data regarding re- 
ferral rates as compared to population proportions. The results 
reflect what has been previously reported in narrative litera- 
ture reviews and analyses of the OCR database. Nevertheless, 
definite answers are precluded by the paucity of research re- 
porting the frequency counts necessary to quantitatively syn- 
thesize findings. Future research on bias in referred samples 
should include population demographics such as the fre- 
quency of the various groups being compared. Only then will 
the field be able to assess whether research on bias in refer- 
rals provides an accurate, overall picture of actual practice and 
determine where bias resides in the process and how it can be 
eliminated. 
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